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Abstract. The idea of modern urban systems and smart cities requires 
monitoring and careful analysis of different signals. Such signals can orig¬ 
inate from different sources and one of the most promising is the BTS, 
i.e. base transceiver station, an element of mobile carrier networks. This 
paper presents the fundamental problems of elicitation, classification and 
understanding of such signals so as to develop context-aware and pro¬ 
active systems in urban areas. These systems are characterized by the 
omnipresence of computing which is strongly focused on providing on¬ 
line support to users/inhabitants of smart cities. A method of analyzing 
selected elements of mobile phone datasets through understanding in¬ 
habitants’ behavioral fingerprints to obtain smart scenarios for public 
transport is proposed. Some scenarios are outlined. A multi-agent sys¬ 
tem is proposed. A formalism based on graphs that allows reasoning 
about inhabitant behaviors is also proposed. 
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1 Introduction 
D 

We face today an unprecedented interest for the idea of smart cities. This idea 
requires smart analysis of many signals and information bits which are generated 
in urban areas, as well as the use of network facilities and interaction of citizens 
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through new technologies. Thus, new and innovative ways of analyzing behaviors 
in cities through understanding the data they generate are needed. New ways 
to analyze and classify this data, as well as further reasoning, in order to better 
understand and plan pro-active support offered by systems to inhabitants are a 
crucial necessity. Pervasive computing is an idea which assumes the omnipresence 
of computer systems to give strong support for inhabitants in smart cities. These 
systems must be characterized by context-awareness, basing on different urban 
signals, to provide pro-active assistance for inhabitants. 

Widespread availability and use of mobile phones, as well as their growing 
ubiquity, is based on BTS wireless networks which guarantee basic communica¬ 
tion in the system. Wireless networks have great potential to provide information 
to identify activities of people. People keep a phone with them most of the time. 
Inhabitant movements and locations are being recorded in many different ways. 
BTSs are responsible for communicating with mobile phones within the network. 
They record many important and useful events, stored in the CDR format, for 
example about the presence of a phone device, which gives an indication of the 
geographic location of the user. Thus, it allows to identify places of inhabitants’ 
lives using mobile networks. Being able to identify users’ movements is crucial 
for smart decisions as well as to work fast and get results in a short time. 

We show that analyzing selected information generated by BTS devices can 
indeed identify inhabitants’ behaviors, help understand human mobility and so¬ 
cial patterns and implement smart scenarios for software systems. A classifica¬ 
tion of sensed behaviors for applications that operate in a smart city is proposed. 
Outlines of smart scenarios are provided. A multi agent system is proposed. A 
formalism which allows reasoning about inhabitant behaviors in the BTS net¬ 
work is proposed. 

The topic of sensing and monitoring urban activities basing on mobile phone 
datasets seems hot and relatively new. In work by Calabresse et al. [T] a real 
time monitoring system is described. Buses and taxis, as well as pedestrians 
movements, are positioned providing urban mobility. In work by Gonzalez et 
al. [3] trajectories of anonymized mobile phone owners are discussed. Human 
trajectories are characterized by a high degree of both temporal and spatial reg¬ 
ularity. Work by Isaacman et al. [S] proposes clustering and regression-oriented 
techniques supporting identification semantically-meaningful locations (home, 
work). Work by Reades et al. [T2] offers a new way of looking at the city as a 
holistic and dynamic system. Some experiments in explorations in urban data 
collection are discussed. 


2 Technical preliminaries 

Systems for mobile communications (e.g. GSM or UMTS) are now well estab¬ 
lished. There are many works introducing in the world of data communication 
procedures, e.g. work [3]. Selected technical aspects of such system are briefly 
outlined below. 
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The most obvious part of the cellular/mobile phone network is a base station. 
A base transceiver station (BTS) is a piece of equipment that enables wireless 
communication between the user and the network. Every BTS performs immedi¬ 
ate communication with mobile phones. Nowadays, cities and regions are covered 
with a relatively dense network of BTSs, see for example Figure Although 



Fig. 1. A sample BTS city network (source: http://btsearch.pl/) 


outside the cities networks are less dense, in each case they gather and store 
important and interesting information about users’ activities. Broadly speaking, 
the entire network system consists of many elements that operate together but 
an ordinary user is not aware of the different entities within the system. 

A call detail record (CDR) is contains data recorded and produced by telecom¬ 
munications equipment. The purpose is to store information about current sys¬ 
tem usage; however, CDR is rather retrospective. It contains data that is specific 
to a single instance of a phone call or other communication transaction. The 
structure of CDR is relatively complex, and its format varies among providers. 
In some situations CDR can also be configured by a user. There is an entry done 
for each call, at the start of a call and at the end of the call. The management 
system is usually configured to update the CDRs periodically. The CDR contains 
variables, e.g. the called number. Variables might be grouped into: variables used 
for identifying calls, timestamps, information related to signaling, information 
related to media, statistics, information related to routing, and others. Records 
are very detailed and contain much information, e.g. point of origin (sources), 
points of destination (endpoints), the phone number of the calling party, the 
phone number of the party being called, duration of each call, the amount billed 
for each call, the route for a call entering the exchange, the route for a call 
leaving the exchange, call type (voice, SMS, etc.), etc. Some data depends on 
the service provider and even in a case of timestamps there are over a dozen of 
different fields. 

CDRs, as collections of information, have a special format [5]. Below is a 
sample fragment of a CDR text decoded from the binary format. The first row 
must contain a header row which includes the field names: 

Call Type,Call Cause,Customer Identifier,Telephone Number Dialled, 
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Call Date,Call Time.Duration,Bytes Transmitted,Bytes Received, 
Description,Chargecode.Time Band,Salesprice, 

Salesprice (pre-bundle), Extension.DDl.Grouping ID,Call Class, 
Carrier.Recording,VAT,Country of Origin,Network, 

Retail tariff code,Remote Network,APN,Diverted Number, 

Ring time,RecordlD,Currency 

The meaning of the columns is not analyzed here since they are intuitive and the 
detailed discussion is outside the scope of the paper. Below is a sample decoded 
text: 


"V","0","+441999887000","+441999878333","28/01/2012", 
"233","","","Hampton","UK Local","Peak","0.8","0.8"," 
"+441999887654","","UKL","Talk Talk","","S","","","", 
"","778789","GBP" 


"10:37:23" 
654" , 

n II II II II II 
iff 


II 


II 


VOIP", "0" ,"BrianbOMl. com"," 
345","","" ,"On-Net","On-Net 
Talk Talk","1","S", 


+442086019080","28/01/2012","10:39:23" 
","Peak","0.0","0.0""ON" , 
",i".^Mi,^g0ii229,"GBP" 


There are many events that generate a CDR record, e.g. data services, such 
as SMS and Internet access. The gathered information allows to obtain BTS 
locations according to a mobile phone activity, i.e. changing a location from one 
BTS to another. Location information is extracted as part of the interaction 
data. These location observations, i.e. 

— the moment of the phone’s/object’s entry into the area of a station (log in), 
and 

— the moment of they leave that area (log out), 

are of fundamental importance to the considerations given in the following sec¬ 
tions of the paper. 


3 Behaviour recognition and classification 

This work discusses the possibility of analyzing data generated and obtained 
from BTS devices/stations. Such stations constitute a rich source of information 
for smart and context-aware systems. This information is related to many aspects 
of users’/inhabitants’ behaviors and base on relatively raw data. From all the 
information generated by a BTS, the most important for these considerations 
are events describing the presence/location (login/logout) of the phone in the 
BTS area. 

Information, or events, obtained from the BTS network can be used to pro¬ 
vide the following classification of user behaviors: 

1. Static behavior, that is without moving outside the BTS area. Some sce¬ 
narios which are appropriate for such behavior are proposed. The aim of 
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such scenarios is to increase the user’s/inhabitant’s comfort of staying in a 
particular area. Appropriate algorithms can take the behaviors registered in 
the past into account which can build a suitable preference model |9l6j . For 
example, providing information about local customer services, shops or spe¬ 
cial offers. If preferences (behavior in the past) also also taken into account, 
then support for users/inhabitants staying in the area becomes more mature 
and valuable. For example, people working in local offices, when approaching 
the habitual and observed time for their lunch, are notified about current 
opportunities in their neighborhood, and table bookings in restaurants are 
suggested, offering personalized ads, etc. In other words, these actions are 
performed after gaining a deep understanding of the inhabitants’ needs. 

The algorithm to identify home/residential or work/office locations might 
be based on the tracking the entire mobile activity during the selected days 
of a week and selected time of a day. In other words, home is defined when 
the mobile phone location is registered after a certain time at night during 
certain days. 

Another example of such support might be the situation when two people 
are informed about the possibility of their meeting as a result of being in 
the same geographic location, if such a meeting has been “ordered” before 
(e.g.: when I am in the same area as the person X, please notify and make 
an appointment). 

2. Dynamic, or mobile, behavior, that is related to the movement of both 
individual users or groups of inhabitants, between neighboring BTS points. 
It seems that dynamic behaviors, as understood here, give a great number of 
possibilities to introduce pro-active scenarios. Such scenarios are especially 
important for the ideas of pervasive computing and smart cities. The desired 
effect is particularly evident when applying some additional, and free, tech¬ 
nologies related to the geographical location and maps, e.g. OpenStreetMap 
0SM[^ or maps of existing urban infrastructure networks, e.g. public trans¬ 
port lines, c.f. Fig. Dynamic behaviors, due to their great potential for 
interesting uses, are to be discussed separately. 

Scenarios related to dynamic behaviors are of fundamental importance in the 
paper. Further considerations are focused on the classification of different types 
of travel in the urban area. The purpose of this classihcation is to distinguish 
two kinds of situations which relate to the observed quick move: 

1. a group of people traveling by public transport, i.e. simultaneously traveling 
groups of persons (phones) after finding that this is not a solitary case of 
traveling by private cars - confirmation of this case is a result of the following 
observations: the comparison to a similar behavior in the past, quick change 
of BTS areas, i.e. switching between BTSs, and comparison of the current 
travel route with public transport lines, c.f. Fig. 

2. people traveling by private cars - evidence of this case might be a result 
of the following observations: a greater speed of a travel comparing groups 

^ OSM is a project to create a free editable map of the world. 
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Fig. 2. A sample city tram/bus network (source: http://www.mpk.krakow.pl/pl/ 
mapki-komunikacyj ne/) 


traveling by public transport, traveling outside the area of public transport 
lines, etc. 

When the above basic classihcation is done, the following support for in¬ 
habitants is considered as a result of a performance of the context-aware and 
pro-active system: 

1. (group) trip by public transport 

— finding convenient transfers for travelers if transfers are expected; 

— in the case of transfers with a long wait for a new connection: finding 
bar/cafeteria facilities in the area in advance to make a reservation; 

— propose to notify people at home (destination) about the arrival time, or 
notifying of the planned arrival in advance a certain number of minutes 
before; 

— hnding alternative connections and transfers, if there are traffic jams 
which slow down a trip or make it difficult; 

— notification of friends/colleagues about a common trip in the same ve¬ 
hicle of public transport, this fact can be confirmed by on-line analysis 
of social networks (e.g.: Facebook, Instagram, etc.); 

— some others; 

2. (individual) trip by a car 

— propose to notify people at home about the arrival time, or notifying of 
the planned arrival in advance a certain number of minutes before; 

— warning regarding the approaching a critical locations, schools, places, 
crossroads, etc., this service requires gathering additional data form 
OpenStreetMap; 

— some others. 

4 System architecture 

In this section we introduce the agent system structure that supports the loT 
services mentioned in the previous sections. Let us consider the structure of a 
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system supporting the simple task of determination of the way in which the owner 
of a mobile phone travels, that is whether travel is done via public transportation 
or via private car. 

The outline of the proposed agent system is shown in Fig.[^ The basic types 



Fig. 3. A sample agent system for a public transportation system 


of agents are: 

— a personal agent PA, that maintains the trace of the current route of a mobile 
phone in the entire city among BTS areas, as well as its stored characteristic, 
taking into account some historical information about previous behaviors; 
every mobile phone owner has its personal agent PA (informally, a guardian 
angel PA); 

— a BTS agent BTS-A for every base transceiver station BTS, that, for every 
personal agent PA that came into the BTS area at the same time slot, creates 
a temporary coordination agent CA which stamps the time of the entry to 
an area and try to characterize every PA’s behaviour in the following way: 

1. if jump among (at least three) regions is slow, then walking, 

2. if jump among (at least three) regions is medium, then private car or 
public transportation, 

3. if jump among (at least three) regions is fast, then private car; 

the speed (slow, medium, fast) is determined taking into account speeds 
observed in the considered area; every agent CA is removed/killed when its 
reasoning process, initiated by an agent BTS-A for a list of jumping PA 
agents passed to CA is finished; 
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— a public transportation manager agent PTlvf] that tries to recognize and 
represent the group of personal agents PAs that move in the same line. 

The following are rules for the CA agent creation algorithm: 

— for every pair of neighboring BTS regions the BTS generates a list of PA 
agents which passed/jumped between two regions in a given time slot; 

— the CA agent is created and the list of jumping agents PAs constitute its 
input; CA gathers information about the trace of previous travel and creates 
a travel graph for all PA agents considered by CA. 

Let us consider a graph G = (P, E) , c.f. [TT] , where vertices V are parts of 
the BTS state which maintains the collection of PA agents that jump to this 
BTS from other BTS at a certain time, and E shows the movement between 
nodes, i.e. (u, w) G E. Nodes in the travel tree represents the BTS in time t, so 
we will describe it as a pair {BTSiD,t) - such node will be called node at level 
t. Initially, at time t, there is only one node for which the CA agent has been 
created. At level t—1, there are nodes from which an object jumps to nodes at 
level t. Edges show from which node at level f — 1 an object is moved to a node 
at level t. Each edge is labelled by the name of the moving object. 

Let us notice that the travel graph is a multi-edge graph, which means that 
more than one edge can exist between two nodes. Next we will designate the 
object using a recursive algorithm; at level t — 1 we initially assume that with 
node X = (id, t — 1) there are associated all objects that will jump from x to any 
node at the higher level. For each object Q associated with node x, agent CA 
retrieves the information about the previous traversal of Q from agent PAq; It 
should be noticed that: 

— the number of associated object can grow, because agent PAq can remember 
that at time t — 1, Q was in idsTS with other objects; 

— this travel enriches the travel graph at levels lower then t — 1. 

When we gather all the information about the route of the object from level 
t — 1, we will update the information about the nodes at level t — 2 and the 
following ones. Time is an attribute of the edge. The following rules should be 
fulfilled in the travel graph: 

— Vn, w G V : {w, PAq) G E time{v) -I- 1 = time{w); 

Vv - 

— and agent PAq moves from w to v. 

The travel graph is a multi-graph, which means that more than one edge can 
exist between the same two graphs nodes. They are differentiated by a label that 
identifies the PA agent. This graph might constitute a base for reasoning. 

^ that can be associated with some buses, if such buses are identified in the network, 
or of it represents a virtual bus - a group of personal agents that move together in 
the same destination 



Towards a better understanding and behavior recognition of inhabitants... 


9 


The decision made by the CA agent is supported by the information from 
the PTM agent that can verify if the route traveled by a PA agent can be 
covered using public transportation. Let us note that we still have a problem 
with differentiation of two situations: 

1. traveling by a bus, 

2. traveling in a column of a few/column cars. 

In such case, historical data is used to make a decision with the most probability. 
Let us note that in next steps we can determine the type of transportation 
because either it is not possible to find a bus travel in this destination or the 
column of cars has been split. A sample travel graph is shown in Figure]^ where 



{14} {9} {11} 

Fig. 4. A screen shot of a sample travel graph 


two arrows from one node to another (if any) symbolize that more than one 
edge exists between two nodes, (i.e. there are two or more edges). Analysis of 
the graph shows that agents from 51 to 71 are traveling together using the public 
transport system. Agent 14 travels by car. 

The agent system can be extended by two more types of agents: 

— a historical data maintainer agent HDM, i.e. an agent that maintains the 
historical data of personal agents; 

— a data mining agent DM, that takes the trace of route traveled by a personal 
agent and processes it into some interesting historical behavior; for example: 

• routes that are covered by walking, via public or private transportation, 

• public agents which usually travel together, 

• a schedule of routes that are executed periodically. 



10 


R. Klimek L. Kotulski 


A personal agent PA returns the information to the DM agent (after finishing 
the travel) and takes it from HDM before starting a new travel. 

Historical data maintained by some agents open an interesting issue that 
supplements the approach presented here. The historical behaviors are encoded 
into logical specifications, and can be later analyzed for satisfiability, c.f. works [3] 
or mm, supporting the current reasoning process and behavior recognition. 

5 Conclusions 

In this paper, the problem of sensing inhabitant behaviors in a smart city are 
considered. The classihcation of behaviors observed using the BTS networks is 
proposed. A public transportation case is discussed, and a multi-agent system 
is proposed. This work opens a research area which is of crucial importance for 
the idea of smart cities. 

Future works may include the implementation of the reasoning engine. It 
should result in a CASE software, which could be a first step involved in creating 
industrial-proof tools. 
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